Modeling Sequential Domain Shift through Estimation of Optimal Sub-spaces for Categorization
نویسندگان
چکیده
Domain adaptation (DA) is the process in which labeled training samples available from one domain is used to improve the performance of statistical tasks performed on test samples drawn from a different domain. The domain from which the training samples are obtained is termed as the source domain, and the counterpart consisting of the test samples is termed as the target domain. Few unlabeled training samples are also taken from the target domain in order to approximate its distribution. In this paper, we propose a new method of unsupervised DA, where a set of domain invariant sub-spaces are estimated using the geometrical and statistical properties of the source and target domains. This is a modification of the work done by Gopalan et al. [2], where the geodesic path from the principal components of the source to that of the target is considered in the Grassmann manifold, and the intermediary points are sampled to represent the incremental change in the geometric properties of the data in source and target domains. Instead of the geodesic path, we consider an alternate path of shortest length between the principal components of source and target, with the property that the intermediary sample points on the path form domain invariant sub-spaces using the concept of Maximum Mean Discrepancy (MMD) [3]. Thus we model the change in the geometric properties of data in both the domains sequentially, in a manner such that the distributions of projected data from both the domains always remain similar along the path. The entire formulation is done in the kernel space which makes it more robust to non-linear transformations. Let X and Y be the source and target domains having nX and nY number of instances respectively. If Φ(.) is a universal kernel function, then in kernel space the source and target domains are Φ(X) ∈ RnX×d and Φ(Y ) ∈ RnX×d respectively. Let KXX and KYY be the kernel gram matrices of Φ(X) and Φ(Y ) respectively. Let D = [X ;Y ] denote the combined source and target domain data, and the corresponding data in kernel space is given as Φ(D). The kernel gram matrix formed using D is given by
منابع مشابه
Sub-optimal Estimation of HIV Time-delay Model using State-Dependent Impulsive Observer with Time-varying Impulse Interval: Application to Continuous-time and Impulsive Inputs
Human Immunodeficiency Virus (HIV) weakens the immune system in confronting various diseases by attacking to CD4+T cells. In modeling HIV behavior, the number of CD4+T cells is considered as the output. But, continuous-time measurement of these cells is not possible in practice, and the measurement is only available at variable intervals that are several times bigger than sampling time. In this...
متن کاملA Hybrid Fuzzy MCDM Approach to Determine an Optimal Block Size in Open-Pit Mine Modeling: a Case Study
The computer-based 3D modeling of ore bodies is one of the most important steps in the resource estimation, grade determination, and production scheduling of open-pit mines. In the modeling phase, the volume of the orebody model is required to be filled by the blocks and sub-blocks. The determination of Block Size (BS) is important due to the dependence of the geostatistical issues and calculat...
متن کاملEstimation of the Domain of Attraction of Free Tumor Equilibrium Point for Perturbed Tumor Immunotherapy Model
In this paper, we are going to estimate the domain of attraction of tumor-free equilibrium points in a perturbed cancer tumor model describing the tumor-immune system competition dynamics. The proposed method is based on an optimization problem solution for a chosen Lyapunov function that can be casted in terms of Linear Matrix Inequalities constraint and Taylor expansion of nonlinear terms. We...
متن کاملEstimation of Software Reliability by Sequential Testing with Simulated Annealing of Mean Field Approximation
Various problems of combinatorial optimization and permutation can be solved with neural network optimization. The problem of estimating the software reliability can be solved with the optimization of failed components to its minimum value. Various solutions of the problem of estimating the software reliability have been given. These solutions are exact and heuristic, but all the exact approach...
متن کاملModeling of the Maximum Entropy Problem as an Optimal Control Problem and its Application to Pdf Estimation of Electricity Price
In this paper, the continuous optimal control theory is used to model and solve the maximum entropy problem for a continuous random variable. The maximum entropy principle provides a method to obtain least-biased probability density function (Pdf) estimation. In this paper, to find a closed form solution for the maximum entropy problem with any number of moment constraints, the entropy is consi...
متن کامل